CDS

Accession Number TCMCG075C04495
gbkey CDS
Protein Id XP_007052578.1
Location join(37141068..37141485,37142090..37142216,37142321..37142427,37142737..37142880,37143357..37143491,37143595..37143777,37144374..37144552,37144713..37144739)
Gene LOC18614660
GeneID 18614660
Organism Theobroma cacao

Protein

Length 439aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007052516.2
Definition PREDICTED: SURP and G-patch domain-containing protein 1-like protein isoform X2 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category L
Description SURP and G-patch domain-containing protein 1-like
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
KEGG_ko ko:K13096        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGAGAAAGGAGTGCCATCTAGCCTTTTTGTTAATGATGGTTCCTTCATGGAGAGGTTTAAACAGCTTCAACAACAGAAGGATGAAAAAGACAAAGCTGCTGCTGCCTTAGAGGAATCTAAACCCCCCAAAATCGTTAAAGGGTCTTCAGCTCCCAAGCCTGCTATTGCTCTTAACAAAATTTCCATGGATTTTAAGCACAATGATGCACGCAAGACCTCCCAAACTTCTTCTGGGGGCAAACTTGCATTCAGCTTGAAACAGAAGTCAAAGCTTGTGGCACCTCCTGTTAAGTTGGCTGCAGACGAGGATGAAGAGGACCAAGATGCAGGAAAGTTGTCAGATGACACACCCGTAAAGCGGCAAAAGTTGTGTCAAGCAGATACCTCCGAACTAGCATCAAAACAAGTGGATGTTGCACTACCTTCCCCAAGTGATCCCAATGTGAAGAAAGTTGCAGACAAACTAGCAAGTTTTGTTGCCAAAAATGGAAGGCAGTTTGAGCATATTACACGGCAAAAAAACCCTGGAGACACACCTTTTAAATTCCTTTTTGATGAGAGCTGTTCTGATTACAAATACTATGAATTCCGGCTTGCTGAAGAGGAAAAAGCTCTTGTACAGAACAAGGAATCTCAAACTCCTCAAAGTGGTGGTATGAGCTTTTCAGCTACTAAGTCCACAAGCAGCTCCCTTAGGTCAGGTCTGCAGCAATCAAGTTATCAAATGCCTGCCTCTGCTTTGTATGAGAATAATGAGGAGCCTAGATCTTCTGCGATGTCAGCAGGAAGAGCAGGTTCATCCAGTGCTCCAACAGGTGCAGATCCTATAGCAATGATGGAGTTTTACATGAAGAAGGCTGCTCAGGAAGAGAAGATGAGACTGCCTAAGCAGTCCAAAGATGAGATGCCTCCACCTCCTTCCCTTCAAGGAGCTCCTTTGAAGAAAGGTCATCACATGGGTGATTATATCCCACCAGAAGAGCTTGAAAAGTTTTTGGCTGCCTGCAACGATGCTGCTGCTCAAAAAGCTGCACGGGAGACTGCAGAGAAGGCAAAGATTCAATCTGATAATGTTGGGCATAAACTTTTGTCAAAAATGGGTTGGAAAGAAGGTGAGGGTTTAGGGGGCTCCAGAAAGGGTATTTCAGATCCGATCATGGCTGGTGATGTAAAGATGAACAATTTGGGGGTTGGTGCTCATCATCCTGGAGATGTGACTGCAGAGGATGATATATATGAGCAGTATAAGAAACGGATGATGCTTGGTTATCGATACAGACCAAATCCTCTGAACAATCCTCGAAAGGCATACTATTGA
Protein:  
MEKGVPSSLFVNDGSFMERFKQLQQQKDEKDKAAAALEESKPPKIVKGSSAPKPAIALNKISMDFKHNDARKTSQTSSGGKLAFSLKQKSKLVAPPVKLAADEDEEDQDAGKLSDDTPVKRQKLCQADTSELASKQVDVALPSPSDPNVKKVADKLASFVAKNGRQFEHITRQKNPGDTPFKFLFDESCSDYKYYEFRLAEEEKALVQNKESQTPQSGGMSFSATKSTSSSLRSGLQQSSYQMPASALYENNEEPRSSAMSAGRAGSSSAPTGADPIAMMEFYMKKAAQEEKMRLPKQSKDEMPPPPSLQGAPLKKGHHMGDYIPPEELEKFLAACNDAAAQKAARETAEKAKIQSDNVGHKLLSKMGWKEGEGLGGSRKGISDPIMAGDVKMNNLGVGAHHPGDVTAEDDIYEQYKKRMMLGYRYRPNPLNNPRKAYY